Picture for Feng Tian

Feng Tian

Jack

StepAudio 2.5 Technical Report

Add code
May 22, 2026
Viaarxiv icon

HFP-SAM: Hierarchical Frequency Prompted SAM for Efficient Marine Animal Segmentation

Add code
Mar 13, 2026
Viaarxiv icon

DMESR: Dual-view MLLM-based Enhancing Framework for Multimodal Sequential Recommendation

Add code
Feb 14, 2026
Viaarxiv icon

Interactive Spatial-Frequency Fusion Mamba for Multi-Modal Image Fusion

Add code
Feb 04, 2026
Viaarxiv icon

Stabilizing Diffusion Posterior Sampling by Noise--Frequency Continuation

Add code
Jan 30, 2026
Viaarxiv icon

Training LLMs with Fault Tolerant HSDP on 100,000 GPUs

Add code
Jan 30, 2026
Viaarxiv icon

RSATalker: Realistic Socially-Aware Talking Head Generation for Multi-Turn Conversation

Add code
Jan 15, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

TCDE: Topic-Centric Dual Expansion of Queries and Documents with Large Language Models for Information Retrieval

Add code
Dec 19, 2025
Figure 1 for TCDE: Topic-Centric Dual Expansion of Queries and Documents with Large Language Models for Information Retrieval
Figure 2 for TCDE: Topic-Centric Dual Expansion of Queries and Documents with Large Language Models for Information Retrieval
Figure 3 for TCDE: Topic-Centric Dual Expansion of Queries and Documents with Large Language Models for Information Retrieval
Figure 4 for TCDE: Topic-Centric Dual Expansion of Queries and Documents with Large Language Models for Information Retrieval
Viaarxiv icon

Spatial-Frequency Enhanced Mamba for Multi-Modal Image Fusion

Add code
Nov 10, 2025
Viaarxiv icon